Linear discriminant analysis: A detailed tutorial

نویسندگان

  • Alaa Tharwat
  • Tarek Gaber
  • Abdelhameed Ibrahim
  • Aboul Ella Hassanien
چکیده

Linear Discriminant Analysis (LDA) is a very common technique for dimensionality reduction problems as a preprocessing step for machine learning and pattern classification applications. At the same time, it is usually used as a black box, but (sometimes) not well understood. The aim of this paper is to build a solid intuition for what is LDA, and how LDA works, thus enabling readers of all levels be able to get a better understanding of the LDA and to know how to apply this technique in different applications. The paper first gave the basic definitions and steps of how LDA technique works supported with visual explanations of these steps. Moreover, the two methods of computing the LDA space, i.e. class-dependent and class-independent methods, were explained in details. Then, in a step-by-step approach, two numerical examples are demonstrated to show how the LDA space can be calculated in case of the class-dependent and class-independent methods. Furthermore, two of the most common LDA problems (i.e. Small Sample Size (SSS) and non-linearity problems) were highlighted and illustrated, and stateof-the-art solutions to these problems were investigated and explained. Finally, a number of experiments was conducted with different datasets to (1) investigate the effect of the eigenvectors that used in the LDA space on the robustness of the extracted feature for the classification accuracy, and (2) to show when the SSS problem occurs and how it can be addressed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A tutorial for Discriminant Analysis of Principal Components (DAPC) using adegenet 1.3-6

This vignette provides a tutorial for applying the Discriminant Analysis of Principal Components (DAPC [1]) using the adegenet package [2] for the R software [3]. This methods aims to identify and describe genetic clusters, although it can in fact be applied to any quantitative data. We illustrate how to use find.clusters to identify clusters, and dapc to describe the relationships between thes...

متن کامل

A tutorial for Discriminant Analysis of Principal Components (DAPC) using adegenet 1.4-0

This vignette provides a tutorial for applying the Discriminant Analysis of Principal Components (DAPC [1]) using the adegenet package [2] for the R software [3]. This methods aims to identify and describe genetic clusters, although it can in fact be applied to any quantitative data. We illustrate how to use find.clusters to identify clusters, and dapc to describe the relationships between thes...

متن کامل

A tutorial for Discriminant Analysis of Principal Components (DAPC) using adegenet 1.3-0

This vignette provides a tutorial for applying the Discriminant Analysis of Principal Components (DAPC [1]) using the adegenet package [2] for the R software [3]. This methods aims to identify and describe genetic clusters, although it can in fact be applied to any quantitative data. We illustrate how to use find.clusters to identify clusters, and dapc to describe the relationships between thes...

متن کامل

Using recursive regression to explore nonlinear relationships and interactions: A tutorial applied to a multicultural education study

This paper discusses how a seldom-used statistical procedure, recursive regression (RR), can numerically and graphically illustrate data-driven nonlinear relationships and interaction of variables. This routine falls into the family of exploratory techniques, yet a few interesting features make it a valuable compliment to factor analysis and multiple linear regression for method triangulation. ...

متن کامل

Single-trial analysis and classification of ERP components - A tutorial

Analyzing brain states that correspond to event related potentials (ERPs) on a single trial basis is a hard problem due to the high trial-to-trial variability and the unfavorable ratio between signal (ERP) and noise (artifacts and neural background activity). In this tutorial, we provide a comprehensive framework for decoding ERPs, elaborating on linear concepts, namely spatio-temporal patterns...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • AI Commun.

دوره 30  شماره 

صفحات  -

تاریخ انتشار 2017